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Abstract 

The task of CDMA multiuser detection is to simultaneously estimate binary symbols of 
K synchronous users from the received N base-band CDMA signals. Mathematically, this 
can be formulated as an inference problem on a complete bipartite graph. In the research on 
graphically represented statistical models, it is known that the belief propagation (BP) can 
exactly perform the inference in a polynomial time scale of the system size when the graph 
is free from cycles in spite that the necessary computation for general graphs exponentially 
explodes in the worst case [1]. In addition, recent several researches revealed that BP can 
also serve as an excellent approximation algorithm even if the graph has cycles as far as they 
are relatively long [2, 3, 4]. However, as there exit many short cycles in a complete bipartite 
graph, one might suspect that the BP would not provide a good performance when employed 
for the multiuser detection. 

The purpose of this paper is to make an objection to such suspicion. More specifically, 
we will show that appropriate employment of the central limit theorem and the law of large 
numbers to BP, which is one of the standard techniques in statistical mechanics, makes it 
possible to develop a novel multiuser detection algorithm the convergence property of which is 
considerably better than that of the conventional multistage detection [5] without increasing 
the computational cost significantly. Furthermore, we will also provide a scheme to analyse 
the dynamics of the proposed algorithm, which can be naturally linked to the equilibrium 
analysis recently presented by Tanaka in [6] . 

1 Multiuser detection 

We will focus on a CDMA system using binary shift keying (BPSK) symbols and K random 
binary spreading codes of the spreading factor N with unit energy over an additive white Gaus- 
sian noise (AWGN) channel. For simplicity, we assume the power is completely controlled to 
unit energy; but the extension to the case of distributed powers is straightforward. Under these 
assumptions, a received base-band CDMA signal is expressed as 

1 K 

y M = -7= s ^ h k + von^, (1) 

where \x E {1, 2, . . . , N} and k E {1, 2, . . . , K} are indices for samples and users, respectively. 
Sfj.k £ { — 1) 1} is the spreading code with unit energy independently generated from the identical 
unbiased distribution P(s /Jj i i = +1) = P(s tJ j t = — 1) = 1/2 and is the bit signal of user k. is 
a Gaussian white noise sample with zero mean and unit variance and do is the standard deviation 
of AWGN. Using these normalisations, the signal to noise ratio is defined as SNR = /?/(2<7q) 
where (3 = K/N . In the following, we assume a situation where both of N and K are large 
keeping j3 finite. 

The goal of multiuser detection is to simultaneously infer the bit signals b\ , by , ■ ■ ■ , b% after 
receiving the base-band signals yi, j/2, • • • , UN- The Bayesian approach offers a useful framework 



Figure 1: Graphical expression of the CDMA multiuser detection problem. Each edge corre- 
sponds to a component of spreading codes s^. 



for such purposes. Assuming that the bit signals are independently generated from the unbiased 
distribution, the posterior distribution given the base-band signals is provided as 



P(b\y) 



K=iP(y,\b) 
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where 
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and A„ = 



1 — ~^J2k=i s nkbk- Following the Bayesian framework, one can systematically derive 



the optimal inference strategy from the posterior distribution (2) for various cost functions. For 
instance, it can be shown that the bit error rate (BER), which is the cost function that we 
will focus on in this paper, is minimised by the maximiser of the posterior marginal (MPM) 
estimator 



b k = argmax ^ P{b\y). 
b k e{+i,-i} blftk 



(4) 



2 Graphical expression and belief propagation 

Unfortunately, the necessary cost for exactly computing the MPM estimator explodes exponen- 
tially with respect to the number of users K in the current system, which implies that one has to 
resort to an approximation in practice. The belief propagation (BP), or the sum-product algo- 
rithm, is known as one of the most promising approaches to such tasks although its performance 
for densely connected systems, including complete bipartite graphs, has not been sufficiently ex- 
amined yet [7] . We here investigate the efficacy of BP in densely connected systems employing 
it to the present CDMA multiuser detection problem. 

In order to introduce this algorithm to the current system, let us denote the base-band and 
bit signals by two kinds of nodes and connect them with an edge when they are related. Since the 
conditional probability of y M (3) depends on all of bi, 62, • • • , &fe, this implies that the posterior 
distribution (2) can be expressed as a complete bipartite graph as shown in Figure 1. 

Then, BP can be defined as an algorithm passing messages between the two kinds of nodes 
through edges as 

P t+1 (y,\b k ,{y^}) oc ^E^I^Il^WW). (5) 
= «^n p '(^IM^}), (6) 

where t = 1, 2, ... is an index for counting the number of updates, a^ k and a^ fc are constants 
for normalisation constraints Efe fc =±i pt (Vn\h, {Vvfr}) = 1 and £b fc =±i P* (b k \{y u ^}) = 1, 



respectively. The marginalised posterior at tth update is evaluated from P as 
^(M?/) = a k riu=i ^ (y/il^fej {y^/u})' where a fc is a normalisation constant. 

As bk is a binary variable, one can parameterise the above functions as P t (y^b^, {Vv^ii}) °^ 
(1 + m^ k b k )/2, P l {b k \{y v ^}) = (1 + m^ k b k )/2 and P\b k \y) = (1 + m£& fc )/2 without loss of 
generality, which simplifies the expressions (5) and (6) as 



m U = tanh tanh 1 ™ik ■ ( 8 ) 

Wa» / 

Employing these variables, the approximated posterior average of bk at ith update can be com- 
puted as m k = tanh (j2^=i tanh -1 m^ fc ) . 

3 Propagating beliefs in a large complete bipartite graph 

Reflecting the fact that each base-band signal y^ is connected with every bit signal b^, evalu- 
ating eq. (7) brings about a computational explosion when K is large, which implies exactly 
performing BP becomes hopeless in the current system. However, appropriately employing the 
central limit theorem and the law of large numbers, which is a standard procedure in statistical- 
mechanical analysis [8, 9], makes it possible to approximately carry out the belief updates (7) 
and (8) in a practical time scale. 

Since s^^/a/A is small for large N, we expand the conditional probability as 



p (vA b ) - X — exp 
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where = J2i^k s fj.i^i/ V A in eq. (7). As the spreading codes are generated independently, 
s^i and bi would be uncorrelated when bi is generated from P t (bi\{y u -^^\) = (1 + m^fy)/2. 
This, in conjunction with the central limit theorem, implies that A^ = J2i^k s nibi/^fN obeys a 

normal distribution M ^A^,/?(l - Q^fj , where = Ei^k s ^i m %/^ and Q\k = 

(1/K) X^fc( m |tz) 2 - Furthermore, due to the law of large numbers, Q l ^ k is highly likely to be 
well approximated by Q l = (1/A) Y^k=i{ m \) 2 ■ Substituting these, one can evaluate eq. (7) as 



^(w-K^X' (io> 



where s M = (s^ k ), = (m^ fc ) and A 1 = (a 2 + (3(1 — Q 1 )) 1 . Here, we also introduced the 
projection and the identity matrices P M = (l/K)(s^kSfj,i) and I = (5ki), respectively. (•■■)& 
denotes fcth component of the vector ■ ■ - . Eq. (10) can be evaluated by O(K) computations per 
pair (fj,k), which implies that 0(NK 2 ) computations are totally required per update. 

The computational cost can be further reduced to 0(K 2 ) when A is large employing eq. (8). 
As m^ fc typically scales as 0(iV -1 / 2 ), eq. (8) can be expanded as m^ fc ~ m\ — (dmk / dm t ^ k )rh t ^ k = 
m k — (1 — (rn k ) 2 ) m^ fc . Plugging this into eq. (10) provides a recursive equation with respect to 



m* +1 = A*?£± - fJA* ( P/1 - 1) m* + /JA'P^mJ,, (11) 

where C* = ((1 — (mj,) 2 )^/)- Employing useful relations P M C*s M = (1 - Q')s M and P M C*P M = 
(1 — Q')P M and omitting negligible terms, the solution of eq. (11) can be expressed as 

mt+i = R tMR _ jjt + ±p A t m t ( 12 ) 

where R l and U* are obtained from recursive equations 

R l = A 1 + A t p(l-Q t )R t -\ (13) 
U\ = A^V^m* + A t p(l-Q t )U t ~ 1 . (14) 

Since typically scales as 0(N^ 1 ^ 2 ), the posterior average can be expressed as m^, = 

tanh (j2^ = i tanh -1 rh^^j ~ tanh (j2^=i ^fc) • This implies that the belief updates (5) and 
(6) are finally summarised into 

h t+1 = R'h -U' + A'm 1 , (15) 
U* = A t pTWm t + A t p(l-Q t )U t ~ 1 , (16) 

and eq. (13), where m* fc = tanh(/i*), = = (E?=i ^ Wv^), h l = {h\) and W = 

iWki) = (j2^=i s u k s ul/N^J . From the posterior average m^, the MPM estimator at tth update 

is evaluated as b\ = sign(mt) where sign(x) = lim e ^ + o x /l x + e l- 

Two points are worthy of noticing. Firstly, the most time-consuming operation in eqs. 
(13), (15) and (16) is Wm 1 , which totally requires 0(K 2 ) computations. This implies that 
the computational cost for performing the current scheme is similar to that of the conventional 
multistage detection [5] 

b{ +1 = sign L° k -J2W kl bU . (17) 
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Secondly, as the fixed point condition, coupled nonlinear equations 



rrifc = tanh 




V W M mi 2 2 — 

1 <r 2 (a 2 + 0(1 - Q)) 



i^k 



(18) 



are obtained from our update scheme, where Q = (1/K) Ylk=i m f ■ This is identical to the 
Thouless- Anderson-Palmer (TAP) equation for the current system known in statistical mechan- 
ics [10] 1 . However, it should be emphasised here that, the naive iteration of eq. (18) does not 
serve as a useful detection algorithm as finding the fixed point by it from a reasonable initial 
state is difficult. This will be illustrated by numerical experiments in the final section. 



4 Density evolution and equilibrium analysis 

The density evolution is a framework to analyse the dynamical property of BP pursuing a 
macroscopic distribution of messages [11, 12]. In the current system, this analysis is considerably 

""in statistical physics, pattern ratio a = N/K = /3 _1 and inverse temperature f3 = /3a^ 2 are usually employed 
for characterising a system instead of [3 and <j%. 



simplified as the aligned field bkh\, is likely to obey a normal distribution as a result of the central 
limit theorem. 

Let us assume that bkh k = bk E^i "ijU ^ s independently sampled from a normal distribution 
the average and variance of which are E l and variance F l , respectively. This implies that the 
overlap M* = EfcLi bkm k /K and Q f are evaluated as 



AT 



= j Dztanh(\^F 1 z + E f ), Q* = J Dztanh 2 (VF 1 z + E f ), (19) 



where = <fc exp[— z 2 /2]/v / 2~7r. Since the MPM estimator is given as b k = sign(^), BER is 

provided as P* = EfcU (1 - sign(6 fc /£))/2 = Jjf ^ Dz. 

On the other hand, as far as b\J^ k is independently sampled, b^m 1 ^ evaluated from eq. 
(10) is uncorrelated for a given k since spreading codes are almost orthogonal with each 
other when N is large as ■ s v /N ~ 0(N~ 1 / 2 ) holds for [i ^ v. This implies that the central 
limit theorem holds for b k h l k +1 , which, in conjunction with the statistical uniformness with 
respect to indices fi and k, provides the average and the variance at t + 1st update as E t+1 = 

(l/iOEj^iE^iMft 1 = (lAFOEjLifc-m^ and F t+1 = (1/K) Ef =1 E?=i (&^ fc ) 2 = 
(l/K)E^Li^« • wit) respectively. Evaluating these employing eqs. (1) and (10), E t+1 and 
F t+l are obtained as 



E t + i = 1 = (3{l-2M t + Q t ) + al 



a 2 + /3(l-Q*)' ' [a 2 + /3(l-Q*)] 



a", (20) 



where we assumed that (1/K) EfcLi b m^ ~ M* holds as a result from the law of large numbers. 
Eqs. (19) and (20) express the density evolution with respect to the current algorithm. 

It should be noticed that the obtained expression of the density evolution directly links the 
proposed algorithm to the equilibrium analysis presented in [6] since eqs. (19) and (20) can be 
regarded as the naive iteration dynamics of the saddle point equations provided by the replica 
method [13]. This implies that our algorithm can practically calculate the MPM estimator (4) 
in 0(K 2 ) computations obtaining the fixed point solution when K is large since the replica 
analysis is likely to evaluate the exact performance for K — > oo, which, however, has not been 
rigorously proved yet. 



5 Method comparison and discussion 

In order to validate the obtained results, we performed numerical experiments in a system 
N = 2000 and (3 = 0.5. Figure 2 shows time evolution of BER obtained from 10000 experiments 
for the proposed algorithm (eqs. (13), (15) and (16): PA), the conventional multistage detection 
(eq. (17): MSD), the iteration of the TAP equation (eq. (18): TAP) and the density evolution 
(eqs. (19) and (20): DE). 

Firstly, it is clear that PA converges to the fixed point considerably faster than MSD, which is 
a highly preferred property in practical use. Secondly, PA and DE exhibit excellent consistency 
as we speculated in the previous section, which implies that employment of the central limit 
theorem and the law of large numbers for deriving eqs. (19) and (20) is fully validated. Finally, 
TAP does not serve as a useful detection algorithm. This is because the iteration of eq. (18) does 
not correctly approximate BP and, therefore, can not sufficiently cancel self-reactions from the 
past states in the transient dynamics although it does provide the correct fixed point condition 
in the stationary state. 

In summary, we have developed a novel algorithm for the CDMA multiuser detection from 
the belief propagation appropriately employing the central limit theorem and the law of large 
numbers. The new algorithm exhibits considerably faster convergence than the conventional 
multistage detection without increasing the computational cost significantly and is likely to 
practically provide the optimal MPM estimator when the spreading factor N is large. We have 
also clarified the relation between the obtained algorithm and the existing equilibrium analysis 
presented in [6] employing the density evolution scheme. 




Figure 2: Time evolution of BER for the proposed algorithm (PA:D), the conventional multistage 
detection (MSD:x), the naive iteration of the TAP equation (TAP:+) and the density evolution 
(DE: lines) in the case of A = 2000, (3 = 0.5 and SNR = 4, 9 (data of TAP are shown only 
for SNR = 9). Each marker represents the averaged BER at tth update evaluated from 10000 
experiments. PA exhibits the fastest convergence and excellent consistency with DE. 



We have here assumed randomly generated spreading codes for simplicity, which might not be 
suitable for practical use. Extension of the current scheme to other methods of code generation 
is under way. 
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